Approximate simulation of quantum channels 
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In Ref. we proved a duality between two optimizations problems. The primary one is, given 
two quantum channels M and A/", to find a quantum channel TZ such that TZoM is optimally close to 
M as measured by the worst-case entanglement fidelity. The dual problem involves the information 
obtained by the environment through the so-called complementary channels M and M, and consists 
in finding a quantum channel TZ' such that TZ' o M is optimally close to N ■ It turns out to be easier 
to find an approximate solution to the dual problem in certain important situations, notably when 
M is the identity channel — the problem of quantum error correction — yielding a good near-optimal 
worst-case entanglement fidelity as well as the corresponding near-optimal correcting channel. Here 
we provide more detailed proofs of these results. In addition, we generalize the main theorem to 
the case where there are certain constraints on the implementation of TZ, namely on the number of 
Kraus operators. We also offer a simple algebraic form for the near-optimal correction channel in 
the case M = id. For approximate error correction, we show that any e-correctable channel is, up 
to appending an ancilla, e-close to an exactly correctable one. We also demonstrate an application 
of our theorem to the problem of minimax state discrimination. 
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Shannon theory and error correction, be their classical 
or quantum version, are based on the problem of trans- 
mitting information through a given noisy channel M by 
choosing an encoding channel £ and a decoding channel 
TZ such that when composed, they simulate a noiseless 
channel id: TZAfE « id. For instance, the capacity of a 
channel Af is the largest ratio n/m such that J\f^™ can 
perfectly simulate id®" in the limit n — >■ oo . 

Classically, the problem of simulating a noiseless chan- 
nel has been found to provide the benchmark for most of 
a channel's information carrying capabilities. In quan- 
tum information theory, however, the situation is more 
complex. For instance, the capacity of a quantum chan- 
nel defined in this way — the quantum capacity — does not 
suffice to determine the capacity of channels used in con- 
junction with each other [3|. 

More generally, one can consider the simulation of an 
arbitrary channel A4: TZAfS ~ Ai. For example, if Af 
is quantum, choosing the target A4 — id®" yields the 
quantum capacity, while using copies of a fully decoher- 
ent (i.e., classical) channel 
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yields the classical capacity. A slightly more general case 
is that where is a noiseless channel on any C*-algebra, 
which yields subsystem quantum error correction (QEC) 
and hybrid quantum-classical error correction Q, or hy- 
brid capacities in the asymptotic case @. 

A fundamental result in the case of standard QEC, 
namely, simulation of the identity quantum channel, is 
the Knill-Laflamme conditions [6], which, given an en- 
coding £, provide a criterion for the existence of a corre- 
sponding decoding channel TZ. Specifically, it says that 
TZ exists, i.e., the channel and code are correctable, if 
and only if the environment gains no information about 



the encoded state. This condition, in its approximate 
form ^ is also the main starting point for the "de- 
coupling" approach to channel capacities Q, namely, 
the corresponding result applied to states via the Choi- 
Jamiolkowski isomorphism. 

Here we detail and extend work presented in Ref. [l| 
which generalizes these results to a generic target A4 . In 
Section |TT1 we prove two slightly more general versions 
of the main theorem of Ref. [ij , which says that the op- 
timal distance (optimized over the decoding operations 
TZ) between TZAf and A4 is equal to the optimal distance 
between Af and TZA4, where the hat denotes respective 
complementary channels. This yields an efficient way of 
estimating the optimal distance for a large class of tar- 
get channels A4 (Section IIII Ap . We also show how to 
explicitly construct a channel TZ achieving the estimated 
distance (Section IIITB|) . In Section UlTCl we show that 
any e-correctable channel is, up to appending an ancilla, 
e-close to an exactly correctable one. We also give an ap- 
plication of our main theorem to the problem of minimax 
state discrimination in Section IIIIDI 

We note that our approach also yields new results in 
the important special case AA — id. In order to measure 
the quality of a simulation, we use a fidelity-based dis- 
tance as in Refs. [§, In contrast to these works, how- 
ever, our approach yields an approximate reversal chan- 
nel for the worst-case entanglement fidelity which is a 
state-independent measure. See Section IIVI for a com- 
parison with these works in the case AA = id. 

Other works focused on the worst-case trace dis- 
tance [7|, illi] . The advantage of the fidelity over these is 
that the optimal fidelity in the dual problem is precisely 
equal to the optimal fidelity in the original problem. Al- 
though the dual problem may not be solved easily, it can 
be precisely estimated. This yields better bounds on the 
original optimization which are useful not only for an- 
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alyzing asymptotic scenarios, but may be advantageous 
also in one-shot scenarios, e.g., to estimate the error in an 
approximate quantum error correction scheme [l^ - [l7j . 

In addition, since our results concern only the prob- 
lem of finding TZ such that TZAf' « M, they are not re- 
stricted to the case where JV' = Af£ but apply also to the 
case where resources are shared between the sender and 
the reci pien t. For instance, for an entanglement-assisted 
scheme |l8l . [loj one would consider a channel of the form 



I. PRELIMINARIES 

In this paper, we restrict our considerations to finite- 
dimensional systems. We begin by introducing the main 
concepts that we will be using to express our results. 

A. Complementary channels 

A channel A/" is a completely positive trace-preserving 
map. It can always be written as 

M{p)=Y,ErpEl (3) 

i 

where the Kraus operators Ei satisfy J^i ^l^i — 1- Con- 
versely, any function of this form is completely positive 
and trace-preserving. The dual N"^ is defined by the re- 
lation 

Tr(AA(p)A)=Tr(pAAt(A)) (4) 
for any state p and any operator A. This implies that 

U\A)=Y,ElpE,. (5) 

i 

Physically, N is interpreted as evolving states, while N'^ 
evolves observables. Hence, A/"^ represents the Heisen- 
berg picture for the evolution defined by the channel. To 
avoid confusion, we only call M a channel, while A/"^ is 
its dual. 

Note that the dimension of the input and output quan- 
tum systems may differ. In this case the operators Ei 
are not square matrices. For instance, there is only one 
channel whose output Hilbert space has dimension one: 
the trace. Channels with a one-dimensional input are in 
one-to-one correspondence with quantum states. Since 
the input is just a complex number z, the output can 
only be of the form zp for some fixed quantum state p. 

We let the reader check that the dual of the trace chan- 
nel takes as an input a complex number a (an operator 
on the one-dimensional Hilbert space), and outputs the 
identity operator times a: 

Tr^(a)=al. (6) 



Note that the partial trace can be written as id ® Tr, 
where id is the identity channel on the subsystem that is 
not traced over. Hence, for instance, 

(id(g)Tr)^(A) = (7) 

Stinespring's dilation theorem f2(l| guarantees that for 
any channel M we can find a (nonunique) isometry V 
which maps the input space of Af into its output space 
extended by an extra system E, the "environment" , such 
that 

J^\A) = V\A<E,1e)V, (8) 

or equivalently, 

A/-(p) = (id®TrE)(ypyt)^ (9) 

where id® Tr^; denotes the partial trace over E. We say 
that V defines a dilation of A^. 

An isometry V is any operator satisfying the property 
V''V = 1. It describes how the input Hilbert space is 
isometrically embedded in the target space. It can be 
seen as a unitary operator but with the input restricted 
to a subspace. For instance, if the target space dimension 
is divisible by the input space dimension, it amounts to 
adding an auxiliary system with a fixed pure initial state 
\(f>Q) and letting it interact unitarily with the system, i.e., 

[/(IV^) ® |0o)) (10) 

for some unitary operator U. 

It is easy to see that this isometry V is not unique. 
Indeed we can always use another isometry W from E to 
a larger environment E' to obtain also 

Af(p) = {id (g>TYE'){V'p{Vy), (11) 

where V' is the new isometry 

y' = (1 (g) w)v. (12) 

Definition 1. Let V define a dilation of the channel Af 
as above. We say that the channel JV, defined by 

Af^B) ^ V\l(g} B)V (13) 
for all B, is complementary to Af. 
Equivalently, we have 

^/■(p) = (Tr®id)(yp0). (14) 

The channel Af maps the initial state of the system to 
the final state of the environment. 

It is clear from the definition that Af is also comple- 
mentary to Af. 

Since the isometry V associated with Af is not unique 
as observed above, there are correspondingly many chan- 
nels complementary to Af. For instance, if Af maps the 
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input system to the environment E, and W is an isome- 
try from E to E' , then the channel Af' defined by 

{U'yiB) = iV')^{l<i^B)V', (15) 

where V' = {1 (E) W)V, is also complementary to Af. 

The connection between the isometry V and the chan- 
nels' Kraus operators can be found by introducing an 
orthonormal basis \i) of the environment as follows: 

Af^A) ^V^{A(g,l)V ^Y.vHA(g,\i){i\)V 

(16) 

= ^y^(l(8) |z))A(l(g) (i|)K 

i 

Hence, we can use 

E, = (10 {i\)V, (17) 

which is defined by 

{i;\E, = ((^1 <g> {i\)V. (18) 

This implies that the complementary channel associated 
with the isometry V can be written in a dual form as 

A/-t(i3)=^(z|i3|j>i?t£;^-. (19) 

B. Minimal dimension of the environment 

Definition 2. Given a channel Af, we write \Af\ for the 
minimal number of Kraus operators with which it can be 
represented. This corresponds to the minimal dimension 
of the environment for which it has an isometric imple- 
mentation (Eq. dS])). 

If the channel Af maps states over the Hilbert space 
Ha to states over Hb, then [2lj 

l<\Af\< dimCHA) dimCHs). (20) 

In addition, \Af \ = 1 if and only if 

Af{p) = VpV'' (21) 

for some isometry V. 

C. Postprocessing order and equivalence relation 

Definition 3. Given two channels Af and A4, we write 

AfhM (22) 

if there exists a channel TZ such that TZAf — A4 (i.e., 
Tl{N{p)) = M{p) for all states p). We also write 

N ^AA (23) 

when we have both Af M and Ai>Af. 



Note that this relation y can equivalently be defined 
dropping the requirement that TZ be trace-preserving. In- 
deed, TZ can always be completed to the trace-preserving 
channel 

TZ'ip) = TZ{p) + Tr[(l - TZ\l))p]a, (24) 

where a is an arbitrary state, such that TZ'N = A4. 

It is easy to see that the relation ^ is a preorder on all 
channels, and that ~ is an equivalence relation. 

The relation Af ^ AA can be seen as a precise way 
of saying that the two channels carry the same informa- 
tion about the initial system, independently of further 
processing. For this reason, we will mostly focus on the 
equivalence classes rather than on individual channels. 

Note that Af A4 does not imply that Af and AA 
are related by a unitary map. For instance, consider the 
family of channels Sr{p) '■= Tr(p)T. These channels are 
all equivalent since Sr = SrSr'- But they can be related 
by a unitary map if and only if r and r' have the same 
spectrum. 

Let us state a few elementary facts in order to build 
up some intuition about this relation. All channels are 
bounded from above by the unitary channels (in partic- 
ular), which are all equivalent to the identity channel 
id, and from below by the channels Sr mentioned above, 
which are all equivalent to the trace Tr (which is the only 
channel with a target space of dimension 1): 

id hAfh Tr. (25) 

Note that for any channels Af, AA, Af' , and A4' , 

Af (E) Af' h A4 (E) A4' Af h M and Af' h A4' . (26) 

In particular, 

Af hAA <=^Af®idh AA® id. (27) 

Since channels are trace-preserving, we always have 

TrW = Tr. (28) 

Let us show that all the channels complementary to a 
given one belong to the same equivalence class. 

Lemma 1. IfAf and Af' are both complementary to Af , 
then Af --^ Af' . 

Proof. Suppose without loss of generality that the dimen- 
sion of the output of Af' is larger or equal to that of Af. 
Then there is an isometry W such that 

J^'(p) = WAf(p)W'f (29) 

for all p. Hence, clearly, Af' > Af . In order to show that 
also Af > Af' , we need to build a channel TZ such that 
Af = TZAf'. Since V^V = 1, we would like to use the 
completely positive map p V'^ pV . Unfortunately, this 
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map is not trace-preserving. Instead, letting P := VV\ 
we use 



n{p) = V^'pV + Triil- P)p). 



(30) 



It is easy to check that this map is trace-preserving. Fur- 
thermore, since (1 — P)V ~ 0, the extra term does not 
affect the abihty of TZ to invert J\f' . □ 
Note that the equivalence class associated with J\f is in 
general larger than the set of channels complementary to 
Af. We will regard the channels belonging to this class 
as generalized complementary channels and will typically 
denote them by a tilde, i.e., as Af. 

Definition 4. The channel A/" is a generalized comple- 
mentary channel of Af ii Af ^ Af, where Af is comple- 
mentary to Af. 

The generalized complementary channels satisfy the 
following important property: 

Ttieorem 2. If Af and A4 are generalized complemen- 
tary channels of Af and Ad, respectively, then 



N> M 



M>N. 



Proof. It is sufficient to show that N > AA. 
For any channel Af , we will write 



(31) 
MhAT. 



(32) 



for an isometric map characterizing a dilation of Af , i.e., 
such that Af = (id ® Tr)Vjs/. Note that if 7?, is a general- 
ized complementary channel of TZ., then [TZ ® id)VV is a 
generalized complementary channel of TZAf . 

li Af > AA, then there is a channel TZ such that TZN = 
A4. It follows that 

Mr^ {TZ(g>id)Vj^. (33) 

But then 

A/" - (Tr ® id)V^r = (TtU ® id)V^r - (Tr (g) id)M, (34) 
which implies that A4 >z Af. □ 

D. Exact error correction 

Theorem [5] directly yields the Knill-Laflamme condi- 
tions for exact quantum error correction on standard 
(subspace) codes. Suppose that the "noise" on the phys- 
ical Hilbert space T-L is modeled by a channel with Kraus 
operators Ei. A standard code can be seen as a sub- 
space of "H, or, equivalently, as an isometric encoding 
map V : Tic ^ H from the logical space He to the 
physical space H. The projector VV^ — P projects on a 
subspace of H isomorphic to He, which is usually called 
itself the "code" . The channel to consider then is 



(35) 



from He to H. It implements the encoding followed by 
the noise. Given the code P (or equivalently the isometric 
encoding V) , the problem of quantum error correction is 
to find a correction channel TZ mapping H back to He 
such that 



7^A/' = idc. 



(36) 



where idc is the identity map on the logical space He. 

Hence, we can directly apply Theorem [2] with A4 — 
ide. It is easy to see that all channels complementary to 
idc are of the form 



|^)(^|Tr(p) 



(37) 



for an arbitrary pure state {ip) living in a space of ar- 
bitrary dimension. In addition, one can show that all 
channels similar to such complementary channels are of 
the form 



p ^ tTt{p) 



(38) 



for some mixed state r, i.e., Eq. (|55| describes the gen- 
eralized complementary channels to idc- Picking any of 
these channels. Theorem [2] states that the existence of a 
channel TZ satisfying Eq. (|36l) is equivalent to the exis- 
tence of a channel TZ' such that 



AA(p)=7^'(r)Tr(p). 



(39) 



But since r is fixed, this is equivalent to the existence of 
a state a such that 



Af{p)=aTr{p) 



(40) 



for any state p. This is the Knill-Laflamme condition. 
This is most easily seen from its dual (Heisenberg picture) 
form, which reads 



Af' {A) = TT{Aa)l 



(41) 



for all operators A. It is enough to check this condition 
for a basis \i){j\ of the space of operators, which yields 



MH\^){J\) = V^E}E,V = {J\a\^)l. 



(42) 



Multiplying on the left and on the right by V and , 
respectively, we obtain 



pe}e,p = K,p, 



(43) 



where Ay — {j\a\i), which is the Knill-Laflamme condi- 
tion in its most familiar form. 

Moreover, we also obtain the generalized Knill- 
Laflamme conditions for the correctability of subsystem 
codes [22 2^], or in fact any algebra, of which subsystem 
codes are a special case |4|. A f-algebra (or algebra for 
short) is a set of operators closed under multiplication 
and which also contains the adjoint of all its elements. 
For instance, suppose that our Hilbert space H is divided 
into two subsystems: H = Ha ^ Hb- Then consider the 
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set A of operators of the form A(E) 1, where A is an op- 
erator on "Hyi and 1 the identity on T-Lb- It is trivial to 
show that A is an algebra. It represents all the local 
observables acting on Hi- In fact this is close to being 
the most general form of a f-algebra. For any f-algebra 
A we can find a decomposition of the Hilbert space into 
orthogonal subspaces Hi which are left invariant by all 
elements of the algebra. Furthermore, when restricted 
to any of these invariant subspaces, the algebra has pre- 
cisely the form described in the above example. Hence, 
the algebra defines a set of subsystems living in a family 
of orthogonal subspaces. This means that any element 
A G ^ is of the form 



A = '^A,(x)li 



(44) 



where Ai (g) 1^ is an operator supported on Hi. Said 
differently, if Pi is the projector on Hi, then PiAPi = 
A, ^ li. 

A handy tool is the projector Va on this algebra, which 
we take to be orthogonal in terms of the Hilbert-Schmidt 
inner product between operators. This is a quantum 
channel satisfying = Va = whose range is pre- 
cisely A. It has the following explicit form: 



VA{p)=Y.^r^^P^pP^)' 



(45) 



where Tr2 is the partial trace over the second subsys- 
tem of the ith subspace, and is the dimension of that 
subsystem. 

We say that an algebra A is correctable for the channel 
N if there exists a "correction" channel TZ such that for 
all A^A, 



(7^oAA)t(A) 



(46) 



Note that A contains the spectral projectors of any 
observable A £ A. Hence, this definition implies that 
measuring A before the action of the channel A/" or after 
the correction will yield the same probabilities, no matter 
what the initial state was. 

Clearly, Eq. gS]) implies that 'Pa°'R-°-N' = Va- Hence, 
an equivalent formulation is to require the existence of a 
(possibly different) channel TZ such that 



UoJV = Va- 



(47) 



This puts the problem in a form suitable for the appli- 
cation of Theorem [21 since it says that JV >: Va- If -4' 
denotes the algebra formed by all operators commuting 
with all elements of A (i.e., the commutant of A), one 
can show that the channel Pa' is a generalized comple- 
mentary channel of Pa ■ Elements of the commutant have 
the form B = li®Bi for the same decomposition into 
orthogonal subspaces. Explicitly, 



(48) 



where Tri is the partial trace over the first subsystem of 
the «th subspace, and rii is the dimension of that subsys- 
tem. 

Hence, Theorem [5] states that the existence of TZ that 
satisfies Eq. (|47)) is equivalent to the existence of TV such 
that 



N = U'Va' 



(49) 



But since Va' is a projector, this is equivalent to requir- 
ing that 



jv = jvrA', 

or, in the Heisenberg picture, 

^VA'j^^'- 



rn 



(51) 



The latter is equivalent to requiring that for all operators 
A, 

AfHA) e A', (52) 

or, for all i, j, assuming M has the form previously used, 

N■\\^){J\) = V^E}E,VeA'. (53) 

This is the form of the conditions derived, using a differ- 
ent method, in Ref. 

II. MAIN THEOREM 



Let /(p, a) = Tr y^'^/pa^/p be the fidelity j25| between 
states p and a. For reasons that will become clear, we 
extend the definition of this function to all positive op- 
erators p or (T of trace smaller than or equal to one (note 
that for operators of trace smaller than one, /(p, a) does 
not have the meaning of fidelity). Even for this more 
general concept, Uhlmann's theorem [25| holds, i.e, we 
have the alternative expression 



/(p,a)=max|(^p|(l0y)|V'.)|, 



(54) 



where {ipp) and lip^) are any purifications of p and cr, re- 
spectively, and the maximization runs over all isometric 
operators V between the extra reference systems. (Note 
that here either V^'V = 1 or W'^ — 1 depending on 
which of the two purifications is of larger dimension). 
Since the quantity f{p,a) is real, we can optimize the 
real part rather then the absolute value of the expression 
('(/'pKl® V^)|V'cr)- In addition, one can show that the opti- 
mization can be done over all operators of norm smaller 
than one rather than just the isometric operators, i.e.. 



/(p,cr) = max Re(i/;p|(l(g)v4)|7/'^) 

h4 < 1 



max Re w), 

\\A\\<\ 



A 



(55) 
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The diagram can be thought of as a circuit where the 
boxes are operators which are not necessarily unitary, 
nor even square matrices. The left half circles repre- 
sent input states, while the right half circles represent 
states which are scalar-multiplied with the correspond- 
ing ouputs. The above diagram thus represents the scalar 
product between the output of the circuit, (1 (g) A)\4>a), 
and the state iV'p)- 

For a given state p, we introduce the "entanglement 
fidelity" between channels A/" and A4 , 

F,(AA,A^) = /((AA®id)(|V)(VI),(X®id)(|^)(V'l)) 



max Re 

ll^ll<i 



A 



V, 



M 



where \ip) is a purification of p. When Ai = id, this 
quantity reduces to the square root of Schumacher's en- 
tanglement fidelity of M [2a] . We will compare channels 
using the worst-case entanglement fidelity. 



F{U,M) 



mm Fp{J\f,M), 
p 



(56) 



which was studied in Ref. 121 



Theorem 3. IfM and M are complementary to Af and 
Ai, respectively, then for any d > 1, 



max FlnN.M) 

|7^|<c^ 



max F(Af,n'M), (57) 

\n'\<d 



where the maxima are over all trace-nonincreasing com- 
pletely positive maps with the appropriate source and tar- 
get spaces, and \TZ\ stands for the minimal number of 
Kraus operators for TZ. 

Proof. The proof closely follows arguments used in 
Ref. Let Vj^ be the isometry for which Af{p) = 

TTEiVxpV^) and JV{p) = TrsiV^pV^), and Vm be the 
isometry yielding both Ai and Ai in the same way. Note 
that any trace-nonincreasing channel TZ can be written 
as TZ{p) = Tr ^{ApA^) for some operator A satisfying 
\\A\\ < 1 from the input Hilbert space of TZ to its output 
space tensored with an "environment" E. Using this fact 
and writing the fidelity using Eq. ()55p . we obtain 



the target systems for Af and Ai , respectively, and E and 
E' are the respective "environments". The state \ipp) can 
be any purification of p. If we reflect the picture with 
respect to a vertical axis through the middle, Hermitian 
conjugating each operator [this amounts to a complex 
conjugation of gp{A,A')], and exchange the wire labels 
E' and B, and E and B', we see that we also have 

max F(Af,TZ'Ai) — max min max Re qo(A,A'), 

|TC'|<rf II^'II<1 P \\A\\<1 

(60) 

where now A' is the operator defining TZ' while A comes 
from Eq. (|55p for the fidelity. Hence, we just have to show 
that we can exchange the maximizations over A and A' 
in Eq. ([55]) . This can be done by applying Shiffman's 
minimax theorem [2^ which says that we can exchange 
the rightmost min and max provided that the function 
is convex-concave in the two arguments (in this case it 
is bilinear), and that the variables are optimized over 
convex sets, which is the case here. Hence, we obtain 
max-Tj F{TZN, M) = may^A' max^ miup Re gp{A, A') = 
maxTj' F{U,TZ'M), where P|| < 1. □ 

Proposition 4. If N' ^ Af and M' ^ M, then 

mayiF{TZU,M) ^m&yiF{TZW ,M'), (61) 

where the maxima are over all quantum channels with the 
appropriate source and target spaces. 

Proof. We have N' = SN , N = S'N', Af' = TAf, Af = 
T'Af' for some channels S,S' ,T,T' ■ Hence, 

maxF(7V,7^7W) = F{Af,TZaA4) 



< F{SAf,STZQM) 
= F{N',STZoT'M') 

< maxF{Af',TZA4'). 

The converse inequality follows in the same way. 



(62) 



□ 



Proof. Suppose that Af maps operators over T-La to oper- 
ators over Ti b , and Ai maps operators over TIa to oper- 
ators over Tic- Then, using Proposition |4] together with 
the main theorem for d maximal, i.e., 



max F(TZAf,M) = max min max Re qo(A,A'), (58) 

|K|<rf ^ ' \\A\\<1 p \\A'\\<1 ^'^^ ' /' V ^ 

where <?„ can be expressed in terms of a circuit: 



9p{A,A')^ 









B' 








A 












E 







(59) 

The small d indicates that the wire below it represents 
a Hilbert space of dimension d, namely, the system E 
mentioned above. The wires labeled B and B' represent 



d = dim{y.Af dim(HB) dim(?^c), 



(63) 



we obtain a variation of Theorem |3] which we will find 
most useful, and which is the direct generalization of 
Theorem H 

Corollary 5. If Af and A4 are generalized complemen- 
tary channels of Af and Ai , respectively, then 

maxF{TZJ\f,A4) = maxF{Af,TZ'M), (64) 

where the maxima are over quantum channels with the 
appropriate source and target spaces. 
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Proof. The only thing we have to show is that the max- 
ima taken without constraint on the number of Kraus 
operators are always attained by trace-preserving maps, 
i.e. quantum channels. We use the fact that for any 
states p,T, and a, and < p < 1, we have from the 
strong concavity of the fidelity "2^ that 

fipp + (1 - p)t, a) > VP /(P, ct) = f{pp, a). (65) 

Suppose that 72. is a trace-nonincreasing completely pos- 
itive map. We can always "complete" it to a trace- 
preserving channel TZ = TZ + S, where S is another 
completely positive map. For example, one can take 
S{p) — Tr(p — TZ{p))t for some state t. We then have, 
using the shorthands Af'^ = M 0id and ip = 

F{nj\f,M) ==min/[(W)"(V'),7W'(V')] 

= /[(W)^(V'o),X^(V'o)] 

= flillUni^o) + (5A/-)«(Vo),A^^(^o)] 

>/[(7^AA)^(Vo),^^^(V'o)] 

> min/[(7^A/')"(?^),X'=(V')] 

= F{TZM,M). 

(66) 

The same argument works for the right-hand side of 
Eq. dMl)- □ 

III. SPECIAL CASE M'^ = M 

Theorem [3] and Corollary [5] might not seem directly 
useful since they express one optimization in terms of a 
different but seemingly equally hard one. However, we 
will show that there are interesting problems, in partic- 
ular error correction and minimax state discrimination, 
where one of the optimizations can be given a general 
and straightforward near-optimal solution. More gener- 
ally, we will consider the case where 

= M. (67) 

One can readily check that this can indeed be satisfied 
in the special case where = 1, i.e., quantum error 
correction. 



which satisfies the triangle inequality: 

d{Af,M) = max dp{Af,M) 
p 

<mSix[dp{Af,n) +dp{n,M)] (69) 

< d{Af,n) +d{n,M). 

The equation in Corollary [5] in terms of this distance 
becomes 

min d{nAf, M ) = min d{Af, U'M) . (70) 

Corollary 6. If M and M are generalized complemen- 
tary channels of Af and M., respectively, and Ai^ — M., 
then 

^d{Af,AfM) < mmd{n.Af,M) < d{Af,AfM). (71) 

Proof. The rightmost inequality follows from picking the 
suboptimal TZ' = N . For the leftmost inequality, suppose 
that 72.Q minimizes d{JV,TZ'A4). Then, using the triangle 
inequality, 

d{Af, NM) < d{N, TZ'oM) + d{TZ'oM, NM) . (72) 

Let Eo '■— mrn-ji d{TZJV, M). We know that the first term 
is equal to Sq since TZ'q is optimal. For the second term, 
note that 

d{TZ'oM,AfM) = d{n'aM'^,NM) < d{TZ'oM,Sf) = eo- 

(73) 

For the last inequality, we used the fact that 

d{J\fTZ, MTZ) < d{Af, M) (74) 

for any channels Af, A4, and TZ. This property follows 
from the fact that TZ simply limits the number of in- 
put states over which the maximum is taken inside the 
definition of the distance. It follows that d{Af,AfA4) < 

2eo. ^ ° 

Note that computing d{Af,AfA4) requires a convex 
maximization over inputs only (27| , which is a significant 
simplification over the minimax mm-fi d{TZAf , A4) . 

B. Near-optimal recovery channels 



A. Near-optimal bounds 

We will concentrate here on the form of our theorem 
given in Corollary [Sj however, it is straightforward to 
apply the same reasoning to Theorem [3] 

In this section, we will replace the worst-case entan- 
glement fidelity F{Af,A4) by a distance d{Af,A4). From 
the Bures distance [30|, we can define dp{Af,M) — 
a/1 — Fp{Af,M), which can be used to define 

d{Af,M) ■.^m&xdp{Af,M) = ^/l-F{A^,M) (68) 



Let us show how we can construct a recovery channel 
TZg which performs as well as guaranteed by our bounds 
(Eq. dZlD), i.e., 

d{TZgA^,A4) < d{N,NM). (75) 

If we take A4 ^ AA io he complementary to A4 and 
Af = Af to he complementary to Af, then the fidelity 
F{Af,AfM) 1 - d{Af,AfMY is given through a mini- 
mization and a maximization as 

F{Af,AfM) = min max Re gp{A, [/'), (76) 

p ||A||<i 
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where the bihnear function gp is defined in Eq. (l59l) . and 

U' yields TV" through 7^(/9) = Tr2[C/'(p «) |0)(0|)(;7')^]- 

The minimax theorem guarantees that there exists a 
saddle point, i.e., a pair (po,^o) such that we have both 

Re (Ao, U') = min ^max^ Re gp{A, U') (77) 

and 

Re 5po(Ao,t/') = max minRe gp{A,U'). (78) 

||A||<i p 

If we know this saddle point, then, defining the trace- 
nonincreasing completely positive map 

5(p) :=Tr2(Ao(p® |0)(0|)At) (79) 
and completing it to the trace-preserving channel 

ng{p)^S{p)+Tr{p-S{p))T (80) 

for some state t, we have 

FiJlgNM) > F{SU,M) 

= min max Re gp{Ao, U') 



= max minRe Q„(An, A') 

||A'||<1 P ^'^^ ' 

> minRe gp{Ao, U') 
p 

= Re gp„(Ao,[/') 



(81) 



i.e.. Tig satisfies Eq. ((75|) . Hence, a near-optimal cor- 
rection channel 72.g is given by the saddle point in the 
minimax problem yielding the estimate F{M iM M). For 
completeness, suppose that instead of M. complemen- 
tary to }A , we use a generalized complementary channel 
M. M.. Let M' be complementary to M.. Using The- 
orem [5] we obtain J\A' ^ A^. As above, we can build VJ^ 
such that 



d{n'Af,M') < d{Af,AfM). 



(82) 



Suppose that T' is such that M = T'M'. Then using 



TZg := T'TZ'g, we obtain 



d{ngN,M) < d{n'gAf,M') < d{N,NM). (83) 
If furthermore A/" ^ A/", we have 

diUgN, M) < d{N,NM) < d{N,KfM) (84) 

by the monotonicity of the distance. 

Let us now focus on the problem of finding the saddle 
point in the case where 



M{p) ^ p ® (J 



(85) 



and we use the complementary channel 
M{p) = aTr(p). 



(86) 



We use a channel M slightly more general than for pure 
quantum error correction so that we can use for Ai the 
most general channel similar to a channel complementary 
to the identity. This means that to simulate the identity 
rather than this channel M, we can just use the near- 
optimal channel TZg that we will obtain and trace out 
the extra state a. 
We also write 



M{p)^y^E,VpV^El 



(87) 



as m Section llDl 

Assuming a = '^jPj\j){j\, and writing a purification 
of a as \tl}) = y/pi\i)A ® |*)_r, we define the operator 



1 



4 



P 



E 



p — 



(We note that the operator Xp defined here is different 
from the one denoted by the same symbol in Ref. flj.) 
One can check that 



gp(A,t/')-Tr[^(X>|0))], 



(89) 



where the state |0) is the state used before to relate A to 
a completely positive map S. Since this state is arbitrary, 
we can just absorb it in the definition of a new operator 
A, which is now not given by a square matrix anymore, 
and we write simply 



gp(A,[/')-Tr(AXt), 



(90) 



where the completely positive map S is obtained from A 

by 



S{p) -^TTB^ApA^). 



(91) 



In terms of Xp, we then have 



F{M,MM) = min max ReTr(AX^) minTr|Xp|, 



P \\A\\<\ 



(92) 
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where 



XpXp. 



The second equahty in Eq. (|92 
niax||A||<iReTr(A;^t) ^ Tr|X 



(93) 

uses the fact that 
which can be seen 
as follows. Let Xp = ^l-'^pl be a polar decomposi- 
tion of Xp. Then ReTr(AXt) = RcTt{A\Xp\V^) = 
ReTT{V'<A\Xp\). Since < < 1 and 

\Xp\ > 0, we have (see Appendix El ReTT{V^ A\Xp\) < 
Tr\Xp\. This bound is achievable ior A — V. 

The following result shows that under certain circum- 
stances one can use as a saddle point the state po mini- 
mizing TrjXpl together with a unitary operator Aq com- 
ing from the polar decomposition of Xp^ , 



(94) 



To express S, it will be convenient to introduce the com- 
pletely positive map $p defined by 

$t(r) = TTBBiXpTXD 

= ^Tr(CTFi'£;|£;jt/)pt/i'£;]T£;,Fp. (9^) 



Note also that 



XlXp = <fp(l). 



(96) 



Proposition 7. Let J\f be defined as in Eq. (157)) and $ 
a.s in Eq. (|95p . Let po be a state minimizing 



Foip) :=Tr./<i>,(l 



We he 



Fo(po) < maxF{TZU,id) < ^Fo{po) + 



(97) 



(98) 



// furthermore the optimal state po is unique and of full 
rank, then any channel of the form 



7^.(r) = $t [$^^^(1)-|^$^J1)-|] +r(r) 



(99) 



for some completely positive map T, is near-optimal in 
the sense that it satisfies 



Fo{po)<F{ngU,id). 



(100) 



Proof. The inequalities involving Fq{pq) follow directly 
from Proposition 1711 In view of the previous discus- 
sion, we only need to prove that the optimal state po 
together with an operator Aq solving Xp^ — Ao\Xpg\ 
form a saddle point for KeTr^AXj,). It is clear from 
the definitions that this pair {Aq,pq) attains the value 
miup max||^||=i Re Tr{AXj^). But we do not a priori know 
which solution Ao of the equation Xp^ — A\Xpg \ is such 
that pq is the minimum for ReTr(AoX^), which is what 
is needed for {Ao,po) to be a saddle point. We show in 
Appendix [B] that if po has full rank, then for any state 



p' , ReTr(AoXp/) is independent of which solution Aq of 
Xpg — A\Xpg\ we choose. Hence, any of them yields a 
near-optimal correction channel. □ 
Note that the condition that po be full-rank is neces- 
sary for the above construction to work. A counterexam- 
ple to this construction for the case when po is not of full 
rank is presented in Appendix [Cl 



C. Nature of the approximately correctable 
channels 

It would seem natural that an approximately cor- 
rectable channel is also close to some exactly correctable 
channel. Here we will show a slight variation of this in- 
tuition in terms of the worst-case entanglement fidelity, 
namely that an approximately correctable channel is al- 
ways similar to one which is close to an exactly cor- 
rectable channel. 

Theorem 8. For any channel Af, there exists an exactly 
correctable channel Aq and a channel M' ^ M such that 



Proof. Let 



d{Af',Afa) = mmd{TZAf,id). 



e := n^ind(7^7V,id). 



(101) 



(102) 



To prove that d{J\f',J\fo) > e, consider the channel TZq 
correcting Afo- Using first the monotonicity of the dis- 
tance and then Corollary [H we conclude 

d(7V',7Vo) > d(7^oA/'^ id) > mmd{nAf\ id) = e. (103) 

For the converse, observe that by Corollary [5] there 
exists a constant channel C, 



C{p) = crTr(p), 

such that 

d{Sf,C)^e. 
Using Theorem 13] with d = 1, 



(104) 



(105) 



e = d{M,C) > min d{njV,C) = min d{Af,nC). (106) 

|i?| = l l-R| = l 

Let Rq be the optimal trace-nonincreasing CP map on 
the right-hand side. It is of the form TZo{p) — ApA'' 
where A''A< 1. Consider the isometry 



U := A® |0) + Vl - A^A(E)\1) (107) 
with corresponding map V{p) := VpV'^ . We have 

e > d{N, TZqC) = d{Af', VC) (108) 

where 

AA'(p) :=AA(p)® |0)(0|. (109) 
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Clearly M' ~ M . In addition, we know from the Knill- 
Laflamme conditions (see Section II Dp that C is exactly 
correctable. Therefore Nq = VC is also exactly cor- 
rectable. ^ □ 
Let us find a channel C complementary to C explicitly. 
If cr = Y^iPMm, we have 

C{p) = ^KN)01pb-)(*| = TrEWpW\ (110) 

ij 

where 

W^Y.VP^\^^{J\^\^J)E■ (111) 

ij 

The corresponding complementary channel is 

in' 

(112) 

Hence we see that the exactly correctable channel Mq in 
the above proof has the form 

Mo{p)^V{p®a)V\ (113) 

which is indeed the general form of a correctable chan- 
nel [m. 

D. State discrimination 

To illustrate the generality of our result, let us show 
how it yields a nontrivial result for minimax state dis- 
crimination (32j . 

We want to specialize the relation 

]^d[NSM) < mmd{'RJ\f,M) < d{N,NM), (114) 

where d{M,M) = ^1 - F{J\f,M) and F{J\f,M) is the 
worst-case entanglement fidelity between channels Af and 
M, to the case where 

Mip)=J2\'){^\p\^){^\ (115) 

i 

and 

^p) = J2p^('\p\') -T.4\j){^\p\^){jh (116) 

i ij 

for a fixed set of states {pi}, where pi = s|si. This is 
the problem of minimax state discrimination [32j . In- 
deed, this channel J\f can be thought of as a classical-to- 
quantum channel, which is just what a state preparation 
is: it outputs a quantum state depending on some clas- 
sical data, namely, the choice of which pi to output. 

Since the output of A4 is diagonal in the basis we 
also expect the output of the optimal channel TZ to be. 



in which case there is a positive operator- valued measure 
(POVM) with elements Ai (J2i ^i = 1) such that 

7^(p)=5]Tr(pA,)|^)(^|. (117) 

i 

We then have 



uia^F{TZAf,M) — max min /^PjTr(pjAi), 

^ i \j j 

(118) 

where pi = {i\p\i) is all that matters about the initial 
state p. Since the classical fidelity is jointly concave in 
both arguments, the minimum of p is achieved when p is 
pure. Hence, 

max F{nAf,M) = max min y/Tr{p,A,). (119) 

The square of this expression is the minimal worst-case 
success probability for the discrimination of the states pi. 

In order to see what the bound F{J\f,J\fA4) is, we need 
channels complementary to Af and Ai , or the correspond- 
ing isometrics V and W. If 

= (120) 

i 

then clearly M{p) — TteWpW'' . Hence, we can pick 
M TtbWpWI Similarly, if 

= (121) 

ij 

then A/'(/5) = TteVpV^ and so we can use A/" ;= 
TiBVpV^. The worst-case entanglement fidelity 
F{Af,AfA4) then can be written as 

F{M,MM) = minmax|/(t/,p)|, (122) 

il) u 

where 



f{U,p)^ 




(123) 
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with 1-0) being a purification of p, and 

X^Y.^\p\i)p,®\i){Q\. (124) 



In the above, we used the notation 



E 




A 



(125) 



i 

We also used that fact that 



(126) 



Therefore, 



and 



F{N,NM.) = min 
p 



Hence, the quantity 



A:=l-mmTrJ5]^p2^2 



satisfies 
1 



-A < min max 

4 A 4 



1 - VTr(p.A,) 



< A, 



(127) 



(128) 



(129) 



(130) 



I.e. 



Ia-— A^ < min max [1 - Tr(p,^i)] < 2A-A^ (131) 

2 16 A i 

This provides a simple estimate to the optimal achievable 
solution to the minimax state discrimination problem. 

We note that the same upper bound, and a better lower 
bound exactly equal to A, can also be derived fio'l by 
applying the minimax theorem to previously obtained 
state discrimination bounds ISSMSal. 



IV. COMPARISON WITH OTHER RESULTS 

Results similar to ours exist in the special case M. — 
id [qI , [ToI . or in the particular case of state discrimina- 
tion |33| . In these works, bounds are derived for the en- 
tanglement fidelity for a fixed state, but a direct applica- 
tion of the minimax theorem yields bounds for the worst- 
case entanglement fidelity flOl|. However, it is not known 



whether there exists an efficient procedure for construct- 
ing near-optimal recovery channels compatible with the 
worst-case bounds obtained in this way. 

Let us show that our method also works for the fixed- 
state entanglement fidelity, at least for the main theorem. 
Then we will see that it yields almost the same bounds 
as in Ref. [l3| in the case Ai = id, albeit weaker. 

It is easy to show that both Theorems[3]and[5]still hold 
if the worst-case fidelity F is replaced by the fidelity Fp 
for a fixed input state p [l[ . The proofs are much simpler 
as the minimum disappears from Eq. (j58p . Hence, it 



suffices to see that max||^||<in^/||<i ^(yl, A') is equal to 
both sides of the equation 

max Fp{nM,M) = max Fp{M,n'M) (132) 

\lZ\<d \lZ'\<d 

when M and Ai are complementary, respectively, to A/" 
and From this it follows also that \i M ^ N and 
M ^ M, then 



max Fp (7^7V, M ) = nmx Fp (TV, WM). 



(133) 



However, since Fp does not have the special property 
expressed in Eq. (|74p . we cannot use the same technique 
to get a simple approximation of max^j/ Fp(M ,TZ' M) in 
the case M."^ = M.. However, we can obtain an inequality 
similar to Eq. ((7T|) for the important case 



M{a)^pTY{a), 



(134) 



where p is the same state as the one used to evaluate the 
fidelity. This channel M. is generalized complementary 
to = id. Hence, this corresponds to the approximate 
quantum error correction problem. 

Concretely, suppose that 7?.q is such that 

dp(A/■,7^^,(p)Tr) = min^^p(AA,7^'(p)Tr) = Eq. (135) 

Then, using the triangle inequality, we have 

dp{N,N{p)i:T) < eo + dp(7e[,(p)Tr, AA(p)Tr). (136) 
The second term is calculated from 

Fp(7^^,(p)Tr,A^(p)Tr) = /(7^^,(p), AA(p)) 

<Fp{K[,n',{p)Tr) = l-el. 

(137) 

Hence, dp{J\f,J\f{p)TT) < 2eo, from which we obtain the 
estimate 

^dp{JV,j\f{p)Ti-) < mindp(7^A/',id) < dp{Af,Af{p)Tr). 

(138) 

This is a weaker form of the bounds derived by Tyson 
(Eq. (153) of Ref. ^) using a different method. The 
upper bound can be seen to be equivalent to the cor- 
responding bound in Ref. [loj . but the lower bound is 
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weaker, which may be significant in the regime where 
the optimal error is large. 

Indeed, if we write the estimate explicitly in terms of 
some Kraus operators Ei of Af, we have 



i^p(AA, AA(p)Tr) = Tr E,p^E}Tr{pE} E,). (139) 
y ij 

This is precisely the quantity that Tyson denotes by A 
(Eq. (154) of Ref. [To^) (note that the quantity that Tyson 
calls 'fidelity' is the square of our fidelity). Tyson writes 
the channel as Af{p) = ^iPiFipF} , where Tv{pF}Fj) = 
Sij and Pi > 0, which is always possible. Hence, we obtain 
his formula for A using Ei — ^^Fi. 

The corresponding near-optimal channel that we ob- 
tain in this way is the same as the one introduced 
by Tyson, which is TZg defined in Eq. ([99]) but with 
Pq = a = p, i.e., 

7^g(r) = ^ $(1)"5] + r(r), (140) 

where 

^\t) ^^{pEIe,)pE}tE,p, (141) 

ij 

and T is any CP map that makes TZg trace-preserving. 
For instance, T can be chosen as 

r(r) = Tr(TP)(j, (142) 

where P projects on the kernel of $(1), and a can be any 
state. 



As explained in Ref. |lO|, this channel is not the same 
as the one used in Ref. [9|, which yields similar bounds 
and was introduced by Petz [sl, [13] who showed that it 
yields exact inversion on two given states. The latter is 
built in the same way, but from the CP map 

^HT) = ^Af^{T)^. (143) 

The performance of this channel with p maximally mixed 
(known as the "transpose channel" ) was also studied for 
approximate QEC in terms of the worst-case fidelity in 
Ref. l3- 
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Appendix A: Proof of the relation 

ReTr(VU|Xp|) <Tr]Xp] 

Let {\i)} be an eigenbasis of \Xp\, \Xp\ = J^j 
Xj > 0. Since V is unitary and ||j4|| < 1, we have 
WV^fAW < < 1. This means that for all j, 

\\VU\j)r = {j\A^VV^A\j) = Y.^J\A^^\^){^\V^A\J) < 1. 

(Al) 

Therefore, 

\{j\V^A\j)\<l, Vj. (A2) 

We thus have 

ReTr(yt^|X,|) = Re ^(j|yt^|^>^. < Y^\{j\V^ A\j)\x, 

j 3 

<^x, sTr|Xp|. 

Appendix B: The saddle point in the case of a 
unique, full-rank po 

Our procedure for constructing a near-optimal re- 
covery channel requires finding a saddle point (poi^o) 
of Regp(^, [/'), where U' yields U through N{p) = 
Tr2(C/'(p ® |0)(0|)(J7')^). In the case of M{p) = p®a, 
we saw that this is equivalent to finding (po,^o) that 
achieves the optimization miup max||^||<i ReTr(v4X^) = 
minpTrjXpj. One way to approach the problem in 
this case could be to first search for po that achieves 
minpTrj^Tpl, which is a convex optimization task. If we 
find po that is unique, then we know that it must be 
the one at the saddle point. Now imagine that this po is 
also of full rank. We will prove that in such a case the 
staddlc-point can be taken to be Aq = Uq, where Uq 
is any unitary that comes from the polar decomposition 
of ATpo, Xpg = Uo\XpJ. 

Clearly, the unitary Uq achieves the maximum 
in max|]^j]<i ReTr(y4Xt^), because KeTr{UoXj,^^) — 

Re Tr(C/o|Arp(, |C/q ) = TrjATp^l, but we also need that po 
achieves the minimum in miup Re Tr(?7oAr^). If C^o is 
the unique maximizer of max||^||<i }ieTY{AXj,^^ ), then we 
know that it must be a saddle point. The problem is that 
in general Aq need not be unique. However, we will see 
that if Po is of full rank, Aq is unique up to a freedom 
that is irrelevant for the value of ReTi{AXj,) whose sad- 
dle point we are looking for. Hence, any operator ^o that 
maximizes ReTr{AXj^^) would yield a saddle point. 
To show this, let us characterize the operators A, 
< 1, that satisfy 

ReTr(AXtj = Tr|XpJ. (Bl) 

Eq. (|B1[) can be equivalently written as 

ReTr(i|XpJ) = Tr|Xp„|, (B2) 
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where (using the cyclic invariance of the trace) 



A UlA. 



(B3) 



Note that ||i|| < ||C/o^||||A|| < 1. 

Let i = 1, . . . ,d, be eigenvectors of \Xp^ \ ordered 
such that the first n < d oi them are all those with 
nonzero eigenvalues, |Xpo| = X]i<n ^* > 0. Then 

Eq. (jB2[) is equivalent to 



Re '^{i\A\i)xi = 



or 



^(1-Re(i|i|i))2;, =0. 



(B4) 



(B5) 



Since ||^|| < 1, we have < 1 for all i < n. There- 

fore, the above is satisfied if and only if 

{i\A\i) = 1, Vi < n. (B6) 

From ||A|| < 1 we also have that for all j < n, 

\\A\j)r = {j\A^A\j) = J2{j\A^\z)mj) 

^i+j2{j\A^\z)mj)<i, 

which is only possible if 

(z|i|j)=0, yt^j,j<n. (B8) 
Since ||^^|| ~ \\A\\ < 1, we obtain via the same argument 

{i\A\j)^0, yi^j,i<n. (B9) 

This implies that A is block diagonal in the basis {\i)}, 
with the upper block (corresponding to the first n basis 
vectors) equal to 1. This is necessary and sufhcient for 
our condition to hold. Let us write 



A = 



1 

B 



(BIO) 



for some matrix B and 1 = X]i<n K) (bl- 
ooming back to A itself, and labeling the blocks of Uq 
by we have that 



A = UqA = 



Uii Ui2\ (1 



U21 U2 



B 



Uu U12B 
U21 U22B 



(Bll) 

Since the left block column of A is unique, we know that 
it is equal to that of ^0 which sits at the saddle point. 

Now, suppose that we replace Xp^ by Xp which is such 
that the support of \Xp\ is within the support of |^po|- 
Then let us show that only the left block column of A 
as defined above would matter for the calculation of the 
pseudo- fidelity 

ReTr(^Xt) =ReTr(yU|Xp|), (B12) 



where Xp = V\Xp\. Indeed, since the only nonzero com- 
ponents of \Xp\ are in the upper left block, for the trace 
in the last expression only the upper left block of V'^ A 
would matter, and this block is 

(yU)n = y/i^ii + 1^21^21 = l^^C/ii + V^^U2i, (B13) 

where Vpi, are the corresponding blocks of V . We see that 
(y^A)ii is independent of any freedom we may have in 
choosing A and therefore behaves just like for the saddle- 
point n 
For M{p) — a, we can write l-'i^poP in the form 



(B14) 



where Ei are suitable Kraus operators of A/", and = 
Tv{E.iaEl) [M{p) = cr Tr(p)]. It is easy to see that a state 
\ip) is in the kernel of l-'i^poP (and therefore of l-'fpol) if 
and only if, for all i, Xi = or pQEj\ip) — 0. If po is full- 
rank, the last condition reads Ej\tp) = 0. This means 
that for any p, the kernel of \Xp\ contains the kernel 
of l^pol, or equivalently, the support of l^'^pol contains 
the support of \Xp\ for all p. By the above argument, 
Aq ~ Uq would be the unique maximizer of ReTr(AX^^) 

up to the freedom in the way A = UqA acts on the 
kernel of \Xp\, which has no relevance for the value of 
ReTr(A|Xp|) = ReTr(ylXtJ. Hence, (C/o,Po) is a sad- 
dle point of ReTr(AXt). 



Appendix C: Inadequacy of the above procedure 
when Po is not of full rank 

Unfortunately, the above argument cannot be used to 
simplify the procedure in the general case, since in princi- 
ple Po need not be unique (take, for example, the extreme 
case where Af is correctable), and even if it is unique, it 
need not be of full rank. Let us illustrate the latter case 
by an example. 

Let A/" be a channel with a 2-dimensional input 
(we will denote the input system by A) and a 2- 
dimensional output (denoted by E) with basis vectors 
{10)^,11)-^} and {|0)-^, |1)-^}, respectively, that acts as 
follows: JV{p^) = (1 - s)p^ + s|0)(0|^Tr(p^). Take 
7W(p^) = |0)(0|^Tr(p'4). Note that = (1 - s)l sA? 



and J\fA4 = M.. Let \i^p)^^ be a purification of p^ 



From the concavity of the square of the fidelity |2£ 
have 



we 



p ^ 

= mmF^iil - s)\i;){^f^ + sTW ® id^(|i^)(V'|^^), 
p 

> min[(l-s)F2(|V;)(V'|^«,M®id^dV')(^|'^'^)) + 
p 

sF^{M^id^\i;){^P\^'^,M^id^{\i;){^P\^^))] > s. (CI) 
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We will show that the lower bound s is actually achiev- 
able for Pq — |1)(1|'^. Indeed, in this case we can take 
\^)AR ^ |1)^|1)«. We then have M id^(|V')(V'l'^^) = 
|0)(0|^ ® We obtain 

= (0|^(1|«((1-.)|1)(1|^®|1)(1|«+ 

s|0)(0|^®|l)(l|«)|0)^|l>^ = ,s. (C2) 

Moreover, it is easy to see that Pq = |1)(1|"^ is the unique 
state that achieves the minimum value. 

The state Pq = |1)(1|'^ does not have full support. To 
show that this does not allow us to obtain a saddle-point 
Aq by simply taking any maximizer of max^i gpg{A, [/'), 
let us look at the support of l^'^pol as a function of p. 
Since J\f has three Kraus operators, Eq = vT~~sl, 
= \/s|0)(0|, E2 = ys|0)(l|, we will take system B 
in the circuit diagram to be 3-dimensional, with basis 
{[O-^), |1^), |2^)}. The dilation of AA (or TV) is 

|0)^^ VT^|0)^|0)^ + yi|0)^|l)^, (C3) 
|1)^^ VT^|1)^|0)^ + V5|0)^|2)^. (C4) 

From this, we obtain the Kraus operators of Af, 

So = \/T^|0)(0| + \/i|l)(0H-V^|2)(l|, (C5) 

£;i = VT^|o)(i|. (C6) 

Using the expression for Xj^Xp in terms of an arbitrary 
choice of Kraus operators, 

Xj,Xp = E,P^E]Tv{E,aEl), (C7) 

we obtain (in our case a = |0)(0|) 

XlXp - Eop'El (C8) 
For p = po = we have 

|X,J^=.|2)(2|. (C9) 
However, if we take, for example, p = |0)(0|, we obtain 

^|o>(o|^|o>(o| = l'/')('^L (CIO) 



where 

\cj,) = VT~s\0) + ^,\l). (Cll) 

We therefore see that the support of \Xpg\ in this case 
does not contain the support of \Xp\ for all p. 

Let us show that due to this fact, not every unitary 
one obtains from the polar decomposition of is a 
saddle point. Denote the left and right ancilla systems (in 
state |0)) displayed in the main circuit diagram by C (of 
dimension 2) and C (of dimension 3), respectively, and 
denote the system corresponding to the middle wire by D 
(the latter has dimension 3). Now, consider the case p^ = 
|1)(1|'4 (the state |V'p)^^ can be taken to be |1)^|1)-"). 
The action of the isometry Vj^ is given by (|C4p . and U' 
in this case realizes this isometry on the input E' when 
the ancilla C is in state |0)'-^ . The input (from the right) 
at E' is in the state |0)^ since by definition this is the 
output of A4. Using that, one easily obtains that the 
overlap reduces to 

<7|i)(i|(C/, U') - yi(l|^'(<^|^[/|2)^|0)^. (C12) 
The real part of .g|i)(i|([/, U') is maximized when 

i7|2)^|0)^ = |1)^V)^. (C13) 

However, we have a freedom of choosing how U acts on 
Span{|0)^|0)^|l)^|0)'^}. To see that not every unitary 
satisfying (jC13p yields a saddle point, consider the case 
of = |0)(0|'^. In this case, the overlap reduces to 

5|o>(o|(C/,C/')-(O|^'(0rC/|0)^|O)^. (C14) 

Since the action of [/ on \(j))^\0)'^ is completely undeter- 
mined by condition (IC13p . we could choose Uq such that 
it satisfies both (jC13|) and, e.g., 

C/o|</>)^|O)^ = -|O)^'|0)^. (C15) 

Then we obtain that Re g\Q^Q\{Uo,U') = — 1 < 
^'^9\i){i\{Uq,U') = y/s, i.e., (C/o,|l)(l|) is not a saddle 
point of Iiegp{A, U') since po = |1)(1| does not minimize 
Regp{Uo,U'). 



